NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Learning Interpretable Concepts: Unifying Causal Representation Learning and Foundation Models

Rajendran, Goutham; Buchholz, Simon; Aragam, Bryon; Schölkopf, Bernhard; Ravikumar, Pradeep (December 2024, Advances in Neural Information Processing Systems (NeurIPS))

To build intelligent machine learning systems, there are two broad approaches. One approach is to build inherently interpretable models, as endeavored by the growing field of causal representation learning. The other approach is to build highly-performant foundation models and then invest efforts into understanding how they work. In this work, we relate these two approaches and study how to learn human-interpretable concepts from data. Weaving together ideas from both fields, we formally define a notion of concepts and show that they can be provably recovered from diverse data. Experiments on synthetic data and large language models show the utility of our unified approach.
more » « less
Full Text Available
Learning Linear Causal Representations from Interventions under General Nonlinear Mixing

Buchholz, Simon; Rajendran, Goutham; Rosenfeld, Elan; Aragam, Bryon; Schölkopf, Bernhard; Ravikumar, Pradeep (December 2023, Advances in Neural Information Processing Systems 36 (NeurIPS 2023) Main Conference Track)

We study the problem of learning causal representations from unknown, latent interventions in a general setting, where the latent distribution is Gaussian but the mixing function is completely general. We prove strong identifiability results given unknown single-node interventions, i.e., without having access to the intervention targets. This generalizes prior works which have focused on weaker classes, such as linear maps or paired counterfactual data. This is also the first instance of causal identifiability from non-paired interventions for deep neural network embeddings. Our proof relies on carefully uncovering the high-dimensional geometric structure present in the data distribution after a non-linear density transformation, which we capture by analyzing quadratic forms of precision matrices of the latent distributions. Finally, we propose a contrastive algorithm to identify the latent variables in practice and evaluate its performance on various tasks.
more » « less
Full Text Available
Learning Linear Causal Representations from Interventions under General Nonlinear Mixing

Buchholz, Simon; Rajendran, Goutham; Rosenfeld, Elan; Aragam, Bryon; Schölkopf, Bernhard; Ravikumar, Pradeep (December 2023, Neural Information Processing Systems (NeurIPS), 2023)

We study the problem of learning causal representations from unknown, latent interventions in a general setting, where the latent distribution is Gaussian but the mixing function is completely general. We prove strong identifiability results given unknown single-node interventions, i.e., without having access to the intervention targets. This generalizes prior works which have focused on weaker classes, such as linear maps or paired counterfactual data. This is also the first instance of causal identifiability from non-paired interventions for deep neural network embeddings. Our proof relies on carefully uncovering the high-dimensional geometric structure present in the data distribution after a non-linear density transformation, which we capture by analyzing quadratic forms of precision matrices of the latent distributions. Finally, we propose a contrastive algorithm to identify the latent variables in practice and evaluate its performance on various tasks.
more » « less
Full Text Available
Learning Linear Causal Representations from Interventions under General Nonlinear Mixing

Buchholz, Simon; Rajendran, Goutham; Rosenfeld, Elan; Aragam, Bryon; Schölkopf, Bernhard; Ravikumar, Pradeep (December 2023, Advances in Neural Information Processing Systems)

We study the problem of learning causal representations from unknown, latent interventions in a general setting, where the latent distribution is Gaussian but the mixing function is completely general. We prove strong identifiability results given unknown single-node interventions, i.e., without having access to the intervention targets. This generalizes prior works which have focused on weaker classes, such as linear maps or paired counterfactual data. This is also the first instance of causal identifiability from non-paired interventions for deep neural network embeddings. Our proof relies on carefully uncovering the high-dimensional geometric structure present in the data distribution after a non-linear density transformation, which we capture by analyzing quadratic forms of precision matrices of the latent distributions. Finally, we propose a contrastive algorithm to identify the latent variables in practice and evaluate its performance on various tasks.
more » « less
Full Text Available

Search for: All records